MPEG-7 Visual Descriptors and Discriminant Analysis
نویسندگان
چکیده
The MPEG-7 standards defines a set of descriptors to characterize the content of visual media [l, 2]. These visual descriptors, such as color and texture descriptors, have undergone extensive evaluation and development based on the application of retrieval ranking. Specifically, under query-by-example (QBE) paradigm, average normalized modified retrieval rank (ANMRR), a rate-accuracy like performance measure, is adopted to test these descriptors on image collection and predefined ground truth datasets. The experimental results show that each descriptor has good retrieval performance. However, there are some questions left to be answered in practice. How to apply visual descriptors in various applications? Does each visual descriptor have good performance in the applications besides retrieval ranking? How to combine multiple visual descriptors for a specific application? What is the performance of the aggregated visual descriptors? It would be generally accepted that a good visual descriptor should have excellent ability to separate distinct visual media content, named discriminant power. In various applications, the discriminant power of visual descriptors would be evaluated by the application-dependent performance criteria. Since the core experiments applied for the MPEG-7 standards concentrate on single visual descriptor and retrieval ranking, the discriminant power of visual descriptors has not been sufficiently evaluated. Particularly, the applications and technologies should be taken into account for evaluating the discriminant power of visual descriptors. This chapter answers the above questions from the perspective of discriminant power.
منابع مشابه
Evaluation and comparison of texture descriptors proposed in MPEG-7
Texture description contributes as one of the most important low-level features in content-based image retrieval. In MPEG-7, homogeneous texture descriptor (HTD), texture browsing descriptor (TBD), and edge histogram descriptor (EHD) have been proposed as texture descriptors. However, no comprehensive evaluation and comparison of these three descriptors have been made. In this paper, we propose...
متن کاملHow good are the visual MPEG-7 features?
The study presented in this paper analyses descriptions extracted with MPEG-7-descriptors from visual content from the statistical point of view. Good descriptors should generate descriptions with high variance, a well-balanced cluster structure and high discriminance to be able to distinguish different media content. Statistical analysis reveals the quality of the used description extraction a...
متن کاملAutomatic Image Annotation for Semantic Image Retrieval
This paper addresses the challenge of automatic annotation of images for semantic image retrieval. In this research, we aim to identify visual features that are suitable for semantic annotation tasks. We propose an image classification system that combines MPEG-7 visual descriptors and support vector machines. The system is applied to annotate cityscape and landscape images. For this task, our ...
متن کاملStatistical analysis of content - based MPEG - 7
The study presented in this paper analyses the visual MPEG-7 descriptors from a statistical point of view. A statistical analysis is able to reveal the properties and qualities of the used descriptors: redundancies, sensitivity on media content, etc. These aspects were not considered in the MPEG-7 design process where the major goal was optimising the retrieval rate. For the statistical analysi...
متن کاملNote to Users
In this work, an application system is proposed to classify American Football Video shots. The application uses MPEG-7 motion and audio descriptors along with Mel Frequency Cepstrum Coefficient features to classify the video shots into 4 categories, namely: Pass plays. Run plays, Field Goal/Extra Point plays and Kickoff/Punt plays. Fisher’s Linear Discriminant Analj^sis is used to classify the ...
متن کامل